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ABSTRACT 


A  statistical  technique  is  explained,  derived  and  illustrated.  It  is  believed  that 
a  study  of  the  values  of  the  terms  of  the  assumed  mathematical  model  (or  the  graphs 
therefrom)  would  be  of  great  informational  help  to  the  scientist  in  interpreting  his 
results.  Analysis  of  variance,  components  of  variance  analysis,  and  the  estimation 
technique  described  herein  applied  to  a  properly  designed  experiment  will  go  far 
toward  extracting  all  the  relevant  information  from  a  sample . 

Estimation  formulas  for  the  terms  of  the  assumed  mathematical  model  are  de¬ 
rived  for  two  of  the  most  used  statistical  experimental  designs,  the  factorial  and  the 
orthogonal  squares.  The  method  is  illustrated  on  a  three -variable,  hypothetical  dose 
recovery  experiment. 

It  is  pointed  out  that  the  method  gives  one  a  handle  on  measuring  quantitatively 
the  separate  and  joint  contributions  to  the  observed  effect,  of  variables  otherwise  hope¬ 
lessly  correlated  and  overlapping. 
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I.  INTRODUCTION 


Considerations  like  those  discussed  herein  tend  to  reduce  the  material  require¬ 
ments  of  research  projects  as  the  methods  and  implications  of  such  planning  and 
analyses  become  more  familiar.  It  becomes  a  case  of  more  prior  thinking  and  plan¬ 
ning  for  the  experiment,  and  less  data.  One  hardly  needs  to  point  out  that  the  number 
of  experimental  animals  to  be  housed  and  cared  for  and  personnel  and  equipment  to 
process  them  for  data  acquisition  are  of  prime  consideration  in  setting  space  require¬ 
ments  in  the  Armed  Forces  Radiobiology  Research  Institute  (AFRRI)  laboratories. 
Faced  with  known  space  limitations,  it  is  imperative  that  we  do  the  best  we  can  to 
maximize  the  amount  and  definitiveness  of  experimental  results. 

It  is  quite  likely  true  that  the  most  definitive  experiments  to  test  for  rejection  of 
alternative  branch  routes  up  the  tree  of  hypotheses,  making  up  the  bases  of  a  scienti¬ 
fic  field,  are  far  more  logical  in  character  than  mathematical.  Nevertheless,  much 
AFRRI  research  will  involve  the  next  stage,  which  is  to  measure  and  quantify  proper¬ 
ties  or  effects  already  known  or  suspected  to  exist.  Considering  the  interrelationships 
of  variables,  this  is  no  easy  task. 

A  systematic  approach  to  the  latter  stage  is  to  set  up  a  general  mathematical 
model  considered  appropriate  to  the  situation,  test  its  components  for  existence  and 
consequent  specialization  of  the  model,  and  finally,  estimate  the  numerical  values  of 
parameters  of  the  model.  This  procedure  represents  efficient  use  of  information  be¬ 
cause  the  models  represent  a  common  means  of  determining  facts  from  experiment 
and  for  suggesting  new  angles  requiring  investigation.  This  paper  introduces  the 
scientist  to  a  small  region  of  mathematical  statistics  germane  to  the  latter  procedure . 
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II.  THE  PROBLEM 


In  many  statistically  designed  multivariate  experiments,  analysis  of  variance 
gives  the  appropriate  significance  tests  of  the  effects  of  the  variables  and  interactions. 
The  experimenter  may  then  wish  to  display  his  findings  tabularly  and/or  graphically. 

It  would  seem  that  this  display  is  not  necessarily  just  to  show  the  observed  results  in 
another  manner,  but  that  the  display  should  imply  the  fundamental,  underlying  basic 
relationships  of  which  his  test  data  are  merely  a  sample.  The  aforementioned  analysis 
of  variance  gives  the  experimenter  reasonable  assumptions  as  to  the  specialized  form 
of  the  mathematical  model  describing  these  basic  relationships. 

This  methodological  paper  shows  how  one  can  find  the  various  terms  of  the  math¬ 
ematical  model.  They  are  least  squares  estimates.  Then  from  these  estimates  the 
experimenter  can  show  values  or  graphs  having  the  properties  assumed  or  learned  from 
the  analysis  of  variance,  and  which  also  fit  his  data  best  in  the  sense  of  least  squares. 

III.  DISCUSSION 

We  restrict  ourselves  to  'fixed'  rather  than  'random'  variables.  Thus  a  dose 
variable  may  have  four  assigned  values  -  they  are  not  randomly  selected.  Also,  for 
the  moment  consider  a  complete  factorial  design  illustrated  as  follows. 

Let  variable  A  denote  the  dose  rates  for  a  set  of  experiments.  For  instance,  one 
might  let  =  10  rad/min  and  a2  =  100  rad/min.  The  limit  for  the  high  dose  rate  would 
be  the  pulse  mode  of  operation  of  the  reactor.  Variable  B  denotes  the  dose  delivered 
in  one  period  of  fractionization.  bj^  =  20  rads,  b2  =  30  rads,  bg  =  40  rads. 
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The  number  of  fractionated  doses  is  to  be  a  constant  k.  A  graphic  presentation 


of  the  exposures  for  a^b^  and  a2b2  is  shown  below. 
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Figure  1.  Graph  of  Exposures 


The  integrated  dose  at  each  condition  ai5  bj  is  simply  bjk. 

All  possible  combinations  of  levels  of  the  variables  are  used.  At  each  of  the 
2x3  =  6  possible  combinations  of  A,  B,  an  LD30/3O  (=  y)  will  be  obtained.  (If  several 
values  are  found  at  each  condition,  the  method  of  estimation  described  herein  will  be 
unchanged,  but  one  considers  only  the  averages  at  each  condition. ) 

It  has  been  found  that  a  practical  and  satisfactory  general  representation  of  the 
combined  several  effects  is  of  the  form 

(1)  yy  =M  +  «i  +  £3  +  Wij  +  eij  • 

Let  subscript  i  denote  the  level  of  variable  A,  thus  i  =  2  means  100  rad/min.  Similar¬ 
ly,  subscript  j  denotes  the  level  of  variable  B,  thus  j  =  1  means  20  rad  dose  per  frac- 
tionization  period.  The  terms  in  (1)  represent  independent  contributions  to  the  total 
y  value  at  aj,  bj .  Each  term  on  the  right  but  the  last  one  is  an  underlying  average 
effect  or  value.  /J,  a j,  /Sj,  are  all  fixed  numbers,  constants,  which  will  be  esti¬ 

mated  from  the  data.  One  or  more  might  be  zero,  depending  on  what  one  learned  from 
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the  analysis  of  variance  on  the  data  in  the  first  place,  or  could  assume  from  past  ex¬ 
perience  or  knowledge.  \jl  is  a  general  average  of  y  over  the  set  of  the  independent 
variables  A  and  B,  common  to  all  y  values,  a ^  is  a  main  effect  (A)  and  can  be  so  re¬ 
lated  to  the  fj,  that  $  =  0.  Condition  a^  causes  a  contribution  of  a i  to  y13,  for  example. 

Similarly  S  /3j  -  0  and,  for  example,  condition  b3  causes  a  contribution  of  to  y^3. 

(a^)ij  is  called  an  interaction  term,  and  when  it  exists  causes  a  special  contri¬ 
bution  to  y.  Thus  the  value  (o^)13  will  be  part  of  the  sum  of  terms  making  up  the  total 
value  of  y13>  If  there  is  no  interaction  then  by  definition  all  such  terms  are  zero.  It 
is  further  assumed  that  summing  over  any  one  interaction  subscript  while  holding  the 
other  subscripts  constant  on  any  of  their  allowed  values  is  zero.  Thus  (ujS)13  +  (ojS)23 
=  0  in  the  example . 

ey  is  a  random  normal  variate  with  mean  zero  and  variance  generally  labeled 

2 

ex  and  estimated  from  the  experimental  data  by  the  analysis  of  variance. 

At  this  time  we  will  derive  the  estimates  of  the  terms  in  (1)  for  the  general  two- 
variable  factorial  case.  By  specializing  the  number  of  levels  of  the  variables  to  two 
and  three,  respectively,  the  results  will  be  appropriate  to  the  preceding  example. 
Obtaining  the  Least  Squares  Estimates  of  the  Values  in  (1) 

Let  my  =ju  +  +  (0j8)y  . 

Then  (1)  becomes 


(2) 


yy  =  my  +  ey 


Let  the  number  of  levels  of  A  and  B  be  and  X2,  respectively.  Form  the  ex¬ 
pression 
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(3) 


A  =  £  (y^  -  rn-)2  -  s.  e«  . 

i=l,...,A!  1J  ]  i.J  J 

j=l,  •  .^2 


A  true  minimum  exists  at  the  necessary  conditions  obtained  by  setting  the  par- 
tials  dA/tyx,  dA/dc^,  dA/d)Sj,  dA^  (ajS)^  equal  to  zero.  One  of  these  will  be  written  out 
in  detail. 

dA/do^  =  -2  S  <yii  -  M  -  -  0.  -  (Q0)ij)  =  0. 

j 

Then 

&2ai  =  ^  yii  ~  2 ^  ~  ?  ~  ?  (QjS)ii  • 

j  J  J  J  J 

But  by  assumption,  the  last  two  terms  are  zero.  Therefore, 


■i  =  t'  5  yij  -X  ■ 

When  one  considers  that  he  is  solving  a  system  of  linear  equations  in  terms  of 
the  y.. ,  the  results  on  /!,  c^,  0*,  (ojS)-  are  clearly  estimates  and  are  customarily 

J  AJ 

labeled  jLl ,  c^,  etc.  Thus 

“i  =  ^  yij  "  ^  ’ 

In  fact,  the  entire  set  of  four  answers  will  be  found  to  be 

^  1 

M  “  hh  i  j  y‘i  ' 

“i  =  ^  5y«  i  =  1 . d 

i-i.  —  .h 

^'ij  =  yy  -M  -?j  ■ 

These  are  the  estimates  of  the  effects  due  to  the  various  levels  of  the  individual 
variables  and  also  when  taken  jointly.  As  an  example,  let  us  interpret  the  effect  of 


r 


(4)  < 
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b‘2  on  yi2-  First  we  must  recognize  that  each  value  y^j  has  average  value  fj,  except  for 
the  addition  to  /i  of  the  several  other  effects  appearing  in  (1) .  The  presence  of  level  b2 
increases  algebraically  the  average  y\2  fromM  to/i  +  /S2 *  However,  when  interaction 
exists,  b2  exerts  an  additional  effect  along  with  level  a-^  to  further  increase  (algebrai¬ 
cally)  the  average  y12  from  fj,  +  /S2  to/i  +  02  +  (a/3)12.  A  similar  statement  shows  that 
level  ai  increases  average  y12  from  /j  +  02  +  W12  to/i  +  02  +  (o0)12  +  a1.  Its  effect 
jointly  with  b2,  namely,  (a/3)22>  has  already  been  added  in.  Since  we  have  only  esti¬ 
mates  of  each  of  these  quantities,  in  practice  the  statements  will  read  somewhat  as 
follows:  the  presence  of  level  b2  is  to  increase  (algebraically)  the  average  y  from  esti¬ 
mated  /i  to  estimated  (/i  +  / S2),  i.e.  ,  from  /i  to  M  +"^2- 

If  one  turns  back  to  the  definition  of  mjj  he  sees  that  it  is  the  mean  or  underlying 
population  average  value  at  a^,  bj  without  the  variation  due  too  of  the  individual  read¬ 
ings.  That  is  likewise  an  estimate  that  is  quite  useful.  It  can  be  found  by  replacing 
each  component  or  term  by  its  estimate  as  just  found.  So  the  estimate  of 

=  /j  +  +  &  +  (oj8)y  becomes 


(5)  my  =/ii +tti  +  j8j  +  (Qj8)y  . 

All  the  above  has  been  carried  out  in  a  generalization  to  n  variables  with 
•  •  •  >  •V  levels,  respectively.  The  results,  only,  will  now  be  stated. 

The  generalized  m^  .1  has  the  expected  appearance  but  with  a  slight  change  of 
notation  to  oq  =  01,  a2  =  0,  0-3, ...  ,0^. 

(6)  .  .in  +  <al)i1  +  <a2)i2  +  •  •  •  +  (4“2)i;Li2  +  •  •  •  +  (oJ*r^Vili2.  .  ,in 

wherein  all  possible  combinations  of  subscripts  appear  (unless  known  or  shown  to  be 
zero) . 
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The  method  of  obtaining  the  preceding  estimates  on  the  right  side  of  the  equation 


turns  out  to  be  a  recursion  formula  in  terms  of  estimates  having  fewer  subscripts. 
Let  IU  =  ^1^2  •  •  •  Then 


yifi2-  •  •  Vr+l-  •  -  in 


Actually,  all  possible  combinations  of  the  o^, . .  .  ,ar  but  less  than  r  in  number 
are  to  be  used,  and  all  with  minus  signs. 

By  the  above  methods  one  can  obtain  least  squares  estimates  of  the  parameters 
supposedly  describing  an  underlying  system.  Thus  if  analysis  of  variance  of  a  factori¬ 
al  experiment  indicates  that  only  main  effects  and  no  interactions  are  significant,  one 
generally  assumes  that  any  families  of  curves  depicting  the  underlying  facts  are  sen¬ 
sibly  parallel  systems.  The  observed  values  will  of  course  not  give  rise  to  such  a 
parallel  appearance,  so  it  seems  reasonable  to  present  to  the  reader  a  set  of  curves 
having  the  parallel  property  and  also  fitting  the  data  closest  in  the  sense  of  least 
squares.  If  the  first  order  interaction  is  found  not  to  be  zero,  then  likewise  the 
(nonparallel)  systems  of  curves  can  be  found  from  the  estimated  parameters,  which 
curves  contain  the  assumed  interaction  effects  and  are  still  best  fitting  to  the  observed 
data  in  the  sense  of  least  squares. 
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IV.  ORTHOGONAL  SQUARES  DESIGNS 


When  it  is  known  that  there  is  no  interaction  among  the  independent  variables, 
the  orthogonal  squares  designs  may  be  used.  These  can  involve  essentially  the  same 
large  coverage  of  both  numbers  of  variables  and  levels  of  each  but  requiring  only  a 
fraction  as  many  experimental  points  or  values.  Of  course, 'the  total  information, 
which  will  include  precision  of  estimates  and  power  of  significance  tests,  is  corres¬ 
pondingly  reduced.  Under  the  preceding  circumstances,  formulas  similar  to  (4),  (5), 
(6),  and  (7),  without  any  interaction  terms  and  with  all  variables  having  the  same 
number  of  levels,  Aj,  for  example,  as  required  by  the  experimental  design,  are  still 
valid. 

Example 

It  is  one  of  the  orthogonal  squares  designs,  called  a  Latin  square,  which  we 
shall  apply  to  a  hypothetical  experiment  to  estimate  fractionated  dose  recovery  effects. 
In  setting  up  an  actual  experiment,  much  consideration  should  be  given  to  the  pros  and 
cons  of  using  a  factorial  versus  orthogonal  squares  or  other  design,  since  so  much  de¬ 
pends  on  the  existence  or  nonexistence  of  interactions.  However,  we  are  now  about  to 
introduce  an  experiment  merely  as  a  vehicle  to  illustrate  properties  of  orthogonal 
squares  designs.  The  experimental  design  itself  will  appear  naturally  in  the  course  of 
describing  the  experiment  somewhat  in  detail. 

The  Discussion  of  the  Experiment 

It  is  desired  to  estimate  the  effect  of  dose  delivered  in  one  period,  number  of 
such  doses  and  time  between  doses,  on  recovery.  Assume  a  fixed  dose  rate  for  all 
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exposures.  Let  the  LD-Q/3Q  be  the  immediate  measure  of  dose  effect,  i.e,  ,  an  in¬ 
crease  in  the  LD5Q/3Q  indicates  a  repair  mechanism,  although  other  effects  could  be 
used,  e.g.  ,  mitotic  index,  and  the  same  analyses  performed  and  kinds  of  conclusions 
drawn . 

Procedure 

Determine  the  LD^q^q  for  a  single  dose  at  the  dose  rate  to  be  used  during  the 
exposures,  if  a  good  estimate  is  not  already  known.  Call  it  A  . 

A  group  of  animals,  say  50,  will  be  given  a  partial  or  fractionated  dose  of  A/k 
followed  by  a  stated  time  interval.  Then  another  partial  dose  of  the  same  size  is  given. 
This  is  repeated  a  specified  number  of  times.  After  this  same  time  interval  has 
elapsed,  the  remaining  animals  are  used  to  find  the  residual  LD5Q/3Q. 

Some  animals  may  have  died  before  this  last  phase .  We  state  our  interest  to  be 
in  those  that  survive  to  this  point.  Since  animals  dying  before  this  stage  tend  to  con¬ 
found  or  bias  the  interpretation  of  the  results,  it  seems  that  the  total  stress  or  dose  to 
this  point  should  be  kept  small  enough  to  minimize  such  effects. 

We  define  and  estimate  recovery  as  follows.  Let  P  denote  a  point  in  time  rela¬ 
tive  to  which  the  LD5Q/3Q  is  to  be  obtained  in  the  experimental  setup.  This  is  of 
course  after  a  prescribed  number  of  alternate  doses  and  intervals.  Let  the  sum  of 
fractionated  doses  to  that  point  equal  d  and  the  LD-Q/3Q  at  P  for  pre irradiated  animals 
at  P  be  Af.  Thus  under  these  conditions  a  total  dose  of  d  +  Af  is  expected  to  be  admin¬ 
istered  at  P  to  achieve  LD5Q/3Q.  But  for  a  single  dose  the  LI? yo/30  =  A.  So 
d  +  Af  -  A  =  the  excess  dose  required  under  the  conditions  of  P. 
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(d  +  A’  -  A)/d  =  the  fraction  of  the  total  dose,  just  prior  to  administering  the 
additional  A'  dose,  which  is  recovered,  or  from  which  recovery  is  made.  For  example, 
if  A'  =  A,  there  is  complete  recovery,  and  if  d  +  A'  =  A,  there  is  no  recovery. 

Let  the  fraction  recovered,  (d  +  A'  -  A)/d,  be  found  at  each  of  the  experimental 
conditions,  P.  This  will  be  the  main  dependent  variable  y  in  the  analysis. 

Let  us  now  define  the  variables,  recalling  that  in  this  design  all  variables  must 
have  the  same  number  of  levels. 

Let  dose  A  have  four  levels,  e.g.  , 

a-^  A/5,  a 2  2A/5,  ag  3A/5,  a^  4A/5. 

Let  B  denote  the  number  of  fractionated  doses, 
bf  =  1,  b2  =  2,  b3  =  3,  b^  =  4. 

Let  C  denote  interval  in  days  between  fractionated  doses. 

C1  =  C2  ~  2,  Cg  =  3,  c^  =  4. 

Maximum  information  would  be  obtained  by  subjecting  a  group  of  animals  to 
each  of  all  possible  conditions  a^,  bj ,  c^,  a  total  of  4'  =  64  groups.  Since  that  many 
groups  are  considered  to  be  a  prohibitive  expenditure  of  animals  and  time,  it  is  pro¬ 
posed  as  mentioned  previously,  to  use  an  experimental  design  known  in  the  trade 
as  a  Latin  square ,  which  will  use  only  16  of  the  above  conditions .  An  example  of 
such  a  design  follows. 
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Figure  2.  Latin  Square  Experimental  Design 


The  circled  position  denotes  condition  a2,  b2,  c^  y,  the  fraction  of  total  dose 
recovered  (as  defined  previously)  will  be  found  at  all  16  conditions.  Then,  under  the 
assumption  of  additivity  (i.e.  ,  zero  interactions),  one  can  estimate  the  y  values 
corresponding  to  all  the  remaining  conditions,  as  explained  in  the  main  text  later. 
Note  the  symmetry  in  the  above  configuration  in  that  each  c  subscript  occurs  once  and 
therefore  only  once  in  every  row  and  column. 

Qualitative  sketches  of  traces  of  the  functional  form  would  probably  appear  as 
below  in  Figure  3. 


y  > 

1  '  \ 

1 

A  FOR  FIXED  B,  C 

B  FOR  FIXED  A,  C 

C  FOR  FIXED  A,  B 

Figure  3.  Qualitative  Traces  of  Function  y 
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With  these  64  values ,  one  is  in  position  to  attempt  curve  and  surface  fitting 
y  -f(A,  B,  C)  in  order  to  descriptively  mathematize  the  results. 

If  some  measurable  and  meaningful  effect  on  the  individual  animal  could  be  used 
as  a  measure  of  the  recovery  effect  instead  of  the  animal  devouring,  time  consuming 
LD50/30  t°  y,  one  could  afford  perhaps  to  go  to  the  complete  factorial  design  and  thus 
test  the  significance  of  the  possible  interactions.  In  this  case  the  number  of  levels  of 
the  variables  need  not  be  the  same.  Or  one  might  use  larger  animals  in  the  experi¬ 
ment. 

Now  in  the  same  example  experiment  it  would  be  possible  to  include  a  new  vari¬ 
able  D  which  is  the  dose  rate.  Let  d]^  denote  10  rad/min,  d2  denote  100  rad/min. 
Another  orthogonal  squares  experimental  design  could  then  be  used  as  follows. 


al  a2  a3  a4 


b 

b 

b 

b 


1 

2 

3 

4 


cldl  c2d2 

c2d3  cld4 
c3d4  c4d3 
c4d2  c3di 


c3d3  c4d4 
c4dl  c3d2 
cld2  c2dl 
c2d4  cid3 


Figure  4.  Graeco-Latin  Square  Experimental  Design 


Now  let  dg  =  dp  d^  =  d2,  for  example,  since  there  are  only  two  distinct  levels  of  this 
D  variable . 

Consider  any  letter  and  subscript,  e.g.  ,  dp  It  occurs  once  and  only  once 
with  every  other  letter -subscript  combination.  This  design  also  requires  only  16 
animal  groups  as  did  the  previous  one.  In  the  same  manner  the  remaining 
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4  -  16  -  256  -  16  -  240  values  could  be  filled  in.  However,  do  not  be  misled  into 
thinking  there  is  more  independent  information  in  the  256  points  than  in  the  16  points. 
It  is  just  that  the  completed  display  may  suggest  some  otherwise  unperceived  informa 
tion. 


We  now  need  the  estimation  formulas  for  this  case. 

Derivation  of  the  Estimation  Formulas 

The  derivation  and  results  are  essentially  the  same  as  for  the  factorial  case. 


The  differences  are  in  the  missing  interaction  terms  like  (aj3)i;j,  and  summations  like 


S  y-jjk  which  are  restricted  to  the  i,  j,  k  combinations  displayed  in  the  experimental 
J  J  ^ 

design.  In  fact,  just  as  an  explicit  reminder  we  write  such  summations  as  S’  ykk. 

j,k  J 


The  prime  (')  is  to  remind  us  that  the  i,  j,  k  values  are  restricted  to  combinations 
found  in  the  experimental  design. 

Again  we  assume  that 


(8) 


mijk  ^  +  /3j  +  yk  , 


but  with  no  interaction  terms. 


i,  j,  k  -  1,  .  .  . ,  subject  to  the  restriction  that  in  combination  they  represent 

one  of  the  conditions  of  the  experimental  design.  By  analogy  with  the  4-level  Latin 

2 

square  displayed  previously,  this  subset  of  combinations  is  in  number  out  of  the 
3 

possible  i-y  total  combinations. 

^  yijk  -  mijk  +  eijk  » 

O 

where  eyk  is  again  a  normal  variate  with  zero  mean  and  variance  cr  (usually  unknown) 
We  seek  to  minimize 

A = jk  (y«k  ■  m«k)2 = j;k  ^ yk>2  ■ 
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Set 


9A 

^‘='2i,f’k<yiik'4'“i',3i'yk)  =  0- 

Each  of  a^,  /3j,  y^  are  summed  over  their  values  and  therefore  by  assumption  each 
adds  up  to  zero,  the  same  assumption  we  made  in  the  factorial  design.  If  the 
“l  +  °2  +  ’  ’  ’  +  a&i  =  1  f°r  example,  we  would  just  diminish  each  by  0. 1/ and 

add  0. 1  to  the^.  It  would  not  affect  their  differences,  and  it  is  only  their  differences 
that  really  count.  So 

^-4—. 

or  the  least  squares  estimate  of  fj,  is 

a°)  7  -  \ .  &  yiik  ■ 

Next 


=-2  ft  ,-firyk>-°. 

1  J  J  ^ 

■s'  yyk  -  -  hai =  0 

J,k  J 

since  again  S'  &  =  0  and  S'  y^  -  0,  each  being  summed  over  its  respective  total  set 
j.k  J  j,k 

of  values.  Therefore, 

(11)  3  =  j-  a  yIjk  -7 

1  J,k  J 

where  \Jl  must  be  replaced  by  its  estimate  in  the  solution. 


Following  exactly  similar  argument, 


(12) 

(13) 


^  *liSkyijk~M 
1 


yk  SL1  yijk  M  • 

The  least  squares  estimate  of  any  particular  the  underlying  true  mean  at 

the  a^,  bj ,  c^  combination  of  levels,  whether  or  not  this  combination  occurs  in  the 
experimental  design  used,  would  be  obtained  by  substituting  component  estimates  like 
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(10),  (11),  (12)  and  (13)  into  (8),  giving 


(14)  mijk=(i+ai  +  ^j+'>/k  • 

Numerical  Computations 

Let  us  place  some  hypothetical  numerical  values  for  y  in  the  four-level  three- 
variable  Latin  square  table  displayed  in  the  section  on  the  recovery  experiment.  The 
y  values  are  percent  recovery  and  are  displayed  in  the  table  below. 


Table  I.  Experimental  Values 


al 

a2 

a3 

a4 

bl 

C1 

72.2 

c2 

74.5 

c3 

75.9 

c4 

68.0 

b2 

c2 

74.0 

C1 

58. 1 

c4 

66.1 

c3 

50.9 

b3 

c3 

77. 1 

c4 

62.4 

C1 

26.9 

c2 

26.7 

b4 

c4 

56.9 

c3 

42.8 

c2 

20.5 

C1 

1.  8 

^  1  „  „  854.8 

M  ig  (72.2  +  74.5  +  ...  +1.8)-  -^g  —  53 . 425  . 

Using  equations  (11),  (12),  (13)  one  obtains  Table  II.  As  an  example 

72  =  ^  (74.  5  +  74.0  +  26.7  +  20.5)  -  53.425  =  -4.500  . 
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Table  II.  Estimates  of  o^,  ySj ,  y ^ 


i 

«i 

£ 

n 

i 

16.625 

19.225 

-  13.675 

2 

6.025 

8.850 

-  4.500 

3 

-  6.075 

-  5.150 

8.250 

4 

-  16.575 

-  22.925 

9.925  l 

We  are  now  ready  to  fill  out  the  entire  table  of 

£ijk=M+ 

if  we  so  desire.  For  example, 

m124  +  o1+  /§2  +  T4  =  53.425  +  16.625  +  8.850  +  9.925  =  88.825. 

See  Table  III  for  the  complete  set  of  entries. 


Figure  5.  Graphs  of  m24k, 
m41k  as  Function  of  C. 


Next  let  us  graph  (estimate  of  true  y  aver¬ 

age)  as  a  function  of  interval  between  doses  (C),  for 
level  2  of  dose  (A),  and  for  level  4  of  (B)  which  is 
four  doses.  The  values  to  be  graphed  are  m^j.,, 
k  =  1,  2,  3,  4.  They  can  be  read  from  Table  III.  For 
comparison  let  us  also  graph  corresponding  to 

one  dose  applied  at  the  highest  level  (Figure  5).  These 
values  are  shown  in  Table  IV.  It  is  evident  that  a 
large  number  of  such  choices  could  be  made,  includ¬ 
ing  even  plotting  a  surface  m^  as  a  function  of  two 
of  the  variables  for  a  fixed  value  of  the  third  one . 
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Table  III.  Estimates  of  m^ 


i  J  k 


m 


ijk 


i  j  k 


mijk 


75.600 
84. 775 
97.525 
99. 200 


52.900 

62.075 

74.825 

76.500 


65.225 
74.400 
87. 150 
88.825 


42.525 
51.700 
64.450 
66. 125 


51. 225 
60.400 
73. 150 
74.825 


28.525 
37.700 
50.450 
52. 125 


14  1 

14  2 


33.450 
42. 625 
55. 375 
57. 050 


341  10.750 

342  19.925 

343  32.675 

344  34.350 


65. 000 
74. 175 
86. 925 
88. 600 


42.400 
51.575 
64. 325 
66. 000 


54. 625 
63. 800 
76. 550 
78. 225 


421  32.025 

4  2  2  41.200 

53.950 
424  55.625 


40. 625 
49. 800 
62.550 
64.225 


18. 025 
27.200 
39.950 
41. 625 


22. 850 
32. 025 
44. 775 
46.450 


0.250 
9.425 
22. 175 
23.850 


The  circled  combinations  appeared  in  the  experiment. 
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Table  IV.  Estimates  of  1^24^  and  0143^ 


k 

m24k 

“41k 

1 

22.850 

42.400 

2 

32.025 

51.575 

3 

44.775 

64.325 

4 

46.450 

66.000 

For  interest,  the  experimental  values  of  the  only  two  conditions  corresponding  to 
points  above  are  read  from  Table  I  and  plotted.  They  are  y(a2,  b4,  C3)  =42.8  and 
y(a4,  bp  c4)  =  68.  0.  Equivalent  notation  would  let  us  write  these  as  y243  =  42.8  and 

Y414  = 68.0. 

The  subject  should  not  be  closed  without  pointing  out  another  possible  and  impor¬ 
tant  application.  This  is  in  estimating  the  effects  of  combined  radiation,  thermal  and 
blast  stresses  on  the  organism.  Note  that  this  would  be  a  three-variable  system  in  the 
simplest  experiment.  A  factorial  or  fractional  replication  factorial  to  estimate  inter¬ 
action  terms  would  surely  be  required.  Again  the  estimated  terms  in  equations  like 
(6)  and  (7)  are  the  required  effects;  (6)  gives  the  combined  effects  and  (7)  gives  the 
separate  effects  of  main  variables  and  interactions.  It  would  require  an  experimental 
environment  wherein  different  prescribed  levels  of  the  three  types  of  insult  could  be 
administered,  either  simultaneously  or  nearly  so. 
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